Live freelance tracking. Raw descriptions turned into structured data. Find your next tech project without the noise.
upwork.com π‘ 2026-05-12
πΉ Collect senior care licensing and compliance data from US state government web sites
π€ Client: πΊπΈ USA Member since 2025-10-24
π° Price: $150
π© Problem: Identify and structure comprehensive licensing and regulatory data for various senior care providers across multiple U.S. states.
π¦ Existing: Not specified
Specifications:
[Target] California (CA), Arizona (AZ), Nevada (NV), Hawaii (HI), North Carolina (NC), Illinois (IL), Florida (FL), Texas (TX), Washington (WA), Colorado (CO), Pennsylvania (PA), New York (NY)
[Method] Web research, URL identification, data extraction, CSV export
[UI/UX] Not applicable
[Stack] Python, BeautifulSoup, Pandas, Openpyxl, Requests
[Security] Data encryption during transfer and storage, secure API access
[Format] CSV files, JSON for APIs
Workflow:
1. Research each state's primary licensing/regulatory agency URL.
2. Identify public license lookup/search portal URLs.
3. Find direct dataset/spreadsheet download URLs or preferred API endpoints.
4. Locate inspection, deficiency, enforcement, and violations data sources.
5. Document bulk downloads, open data portals, or developer APIs.
6. Extract provider/facility name, address, contact information, license number/status, ownership/operator details, inspection dates, violations/deficiencies, enforcement actions, bed counts/capacity, Medicare/Medicaid identifiers.
7. Validate completeness and structure of collected data.
8. Export structured data as CSV files for each state.
9. Compile a master summary document listing all URLs, agency names, portal/API details, download methods, and notes on data limitations or scraping requirements.